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PROTEIN EXPRESSION VIA SEED 
SPECIFIC REGULATORY SEQUENCES 

FIELD OF THE INVENTION 
This invent ion relates to seed specific expression casettes.for 
use In regulation and expression of proteins In plants. 

BACKGROUND OF THE INVENTION 
In many plant species, seed storage proteins represent the 
predominate source of amino acids and nutrition for the germinating 
plant. It Is well known that genes which code for storage proteins 
are strictly regulated. Seed storage proteins are produced in large 
quantities at a determined stage of development; expression is both 
tissue specific and developmental ly controlled. These proteins are 
generally localized in specialized organelles called protein bodies 
or protein storage vacuoles and are essentially protease free. 
15 Because the seeds of many plants can readily synthesize large amounts 
of these storage proteins it is believed that the biological machin- 
ery within seed cells can be put to use synthesizing proteins of 
importance to pharmacological and Industrial needs. Thus, charac- 
terization of the storage proteins as well as the ability to direct 
protein synthesis toward the protein bodies Is of continuing ex- 
perimental, commercial, and nutritional interest. 

The storage proteins of the major grain crops have been par- 
ticularly well studied. In the legumes the major seed storage 
proteins are the globulins, represented by two types: the nonglyco- 
sylated 11 S proteins and the glycosylated 7 S fraction. In the 
common bean (Fhaseolus vulgaris) about 50% of the total seed protein 
is represented by a group of three polypeptides (the a. 0, and 7 
subunits) collectively known as phaseolin. In soybean (Glycine max) 
the major storage protein in known as 0-conglyeinin and its subunits 
30 (a, a\ and fi) represent up to 80% of total seed protein in that 
genus. 

In corn (Zea mays), a representative monocot, the protein zein 
constitutes about 80% of corn seed protein. While there are many 
zein-related genes responsible for zein production, one type, the $- 
2 ins, are represented in very low copy number and yet account f r 
10% f total seed protein. 

However, the seeds of many dlcot and monocot plants are defl- 
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dent in certain ess ntial amin acids which require th addition of 
these amin acids when these seeds are used as the predominate source 
£ protein in the human r animal di t. For a number f years 
investigators have attempted to couple the high level expression of 
seed storage proteins with recombinant engineering techniques to 
produce plants with enhanced nutritional value. For instance, while 
legume seed contain high levels of storage protein, the protein is 
deficient from a nutritional standpoint in that the globulins are 
relatively low in sulfur-rich amino acids, making legume seed a 
nutritionally incomplete foodstuff. Recombinant techniques could be 
used to increase the amount of sulfur in the seed storage proteins or 
direct synthesis of heterologous sulfur-rich proteins. The develop- 
ment of seed specific promoter cassettes for monocotyledons plant 
species is also important since the seeds from these plants are 
widely used as a food source for animals and humans. Thus, in a 
similar manner. zein, which is lacking in the essential amino acid 
lysine, can be made nutritionally complete. 

On an industrial scale, such techniques can be used to direct 
heterologous proteins or enzymes into the storage vacuoles. As 
protease-free zones within the seed, such expression would serve as 
an abundant and easily purified source of protein. Thus, plant seed 
could provide an alternative means for producing hormones and enzymes 
needed for commercial and industrial use. 

By using genetic engineering techniques it is already possible 
to transfer genes into many plant species; direct expression of 
homologous (native to the host) and heterologous (foreign) plant 
genes is well established. Numerous reports indicate that seed 
storage protein genes have been transferred to other plants with 
expression in both a tissue specific and developmental^ regulated 
manner. Beachy. et al.. EMBO 4:3047-3050. (1985). as well as into 
protein bodies. Greenwood and Chrispeels. Plant Physiol. 79-65-71 
(1985). 

The expression of non-plant genes in plants has also met with 
success, but on a more limited level. It has been suggested that the 
difficulty in attaining expression of non-plant proteins is due to a 
failure of the plant splicing machinery to recognize introns (also 
called Intervening sequences) present in the non-native transcribed 
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pre-mRNA, K. Hartmuth and A. Barta, Nucleic Acids Res. 14: 7513-28 
(1986), since removal of introns is necessary in order for the mRNA 
to be functional. S e, M. Green, Annu. Rev. Genet. 20: 671-78 (1986). 
For instance, Barta, et al, Plant Mol cular Biol. 6:347-357 (1986), 
report construction of a chimeric gene in which the complete human 
growth hormone (hCH) gene is flanked by DNA fragments coding for the 
promoter and polyadenylation sites of the nopaline synthase gene. 
When the chimeric gene is introduced into sunflower and tobacco 
callus tissue the gene is transcribed to the pre-mRNA stage. However, 
the transcript is non- functional in the plant tissue as the pre-mRNA 
is not further processed by the plant to mRNA. Thus the foreign 
protein, hGH. is not synthesized. 

An alternative to direct expression of foreign proteins is to 
make a gene construct wherein the gene encoding for the desired pro- 
tein is inserted within a nonfunctional region of a plant protein. 
European Patent Application 319 353 discloses a method for expressing 
a desired protein by modification of the hypervariable region of a 
determined storage protein by an insert encoding the desired peptide. 
In this way a modified storage protein is expressed which has 
contained within it amino acids which code for the foreign protein. 
The foreign protein is then cleaved by enzyme digestion, isolated and 
purified. However, the insert must be in reading phase with the non- 
modified part of the storage protein on either side of the insert, 
thus effectively limiting the size of the foreign protein which may 
be expressed to the length of the hypervariable region of the storage 
protein. 

The present invention overcomes the problems outlined above by 
providing for seed specific expression of a desired protein compris- 
ing fusion of a gene with a seed specific gene regulatory sequence. 
INFORMATION DISCLOSURE STATEMENT 

The present invention uses a gene amplification technique which 
Is a modification of the polymerase chain reaction (PCR) method 
described in U.S. Pat. 4,683,195. 

European Patent Application 255 378 provides nucleotide se- 
quences for seed specific transcriptional regulation. However, the 
reference does not teach the translation cassettes of the present 
invention. 
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Eur pean Patent Application 295 959 refers t DNA expression 
cassettes for transcript! n of the brazil nut sulfur- rich 2S seed 
storage protein. The reference neither teaches nr discloses the 
translational expression cassettes of the present invention. 

European Patent Application 319 353 refers to a process for the 
production of useful biologically active polypeptides through the 
modification of plant genes encoding S2 albumins and is discussed 
above . 

An East German Patent Application. DD 263-081-A, reports a 
process for the production of useful proteins via a vector containing 
a regulatory sequence isolated from a plant gene. 

Barta, A., et al. -The Expression of Ropaline Synthase-Human 
Growth Hormone Chimaeric Gene in Transformed Tobacco and Sunflower 
Callus Tissue". Plant Molecular Biol. 6:347-357 (1986). report 
construction of a chimeric gene containing the complete human growth 
hormone (hGH) gene flanked by the promoter and polyadenylation site 
of the nopaline synthase gene and is discussed above. 

Beachy. et al, -Accumulation and Assembly of Soybean 0-Con- 
glycinin in Seeds of Transformed Petunia Plants", EMBO, 4:3047-3053 

(1985) , report the accumulation of the a'-subunit of the soybean seed 
storage protein 0-conglycinin in the seeds of transformed petunia 
cells. The reference does not teach or disclose the cassettes of the 
present Invention. 

P.P. Chee. et al, -Expression of a Bean Storage Protein 
•Phaseolin Minigene' in Foreign Plant Tissues", Gene, 41:47-57 

(1986) , report that stable and functional phaseolin mRNA can be 
produced in tobacco by the use of an mutant phaseolin gene lacking 
the native introns. 

The gene coding for the 0-zein 15Kd protein and regulatory 
regions are used in this invention. The sequence and isolation of 
this sulfur-rich zein protein is disclosed in Pedersen, et al, 
-Sequence Analysis and Characterization of a Maize Gene Encoding a 
High-Sulfur Zein Protein of M r 15.000". J. Biol. Chem. 261:6279-6284 
(1986). 

Sengupta-Gopalan, et al. "Developmental^ Regulated Expression 
of the Bean ^-Phaseolin Gene in Tobacco Seed". Proc. Natl. Acad. USA. 
82:3320-3324 (1985). report the transformation of tobacco with a 3.8- 
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kB phaseolin DNA fragment containing the coding region and flanking 
DNA necessary for c rrectly regulated expressi n f the gene. 
However, the present invention uses specific linkage sequences not 
taught by the reference. 
5 The complete nucleotide sequence of the French bean seed storage 

protein phaseolin is disclosed in Slightom, et al, "Complete Nucleo- 
tide Sequence of French Bean Storage Protein Gene: Phaseolin", Proc. 
Natl. Acad. USA, 80:1897-1901 (1983). Some of the methods reported 
therein are used in the present invention. 
10 SUMMARY OF THE INVENTION 

The present invention provides a seed specific expression 
cassette which has a promoter derived from either phaseolin, a'- 
subunit of 0-conglycinin, or 0-zein 15Kd; a translation initiation 
signal from either phaseolin, the o'-subunit of 0-conglycinin, fi-zein 
15 15Kd, or an animal gene; a gene derived from brazil nut sulfur-rich 
seed storage protein or an animal gene; a translation termination 
region derived from either the animal gene, phaseolin, or 0-zein 
15Kd; and at least one polyadenylation region from phaseolin, the 
animal gene, or 0-zein 15Kd; wherein the regulatory sequences are 
operably linked to one another in such a manner that the gene Is 
expressed In seed or seed storage protein bodies. The invention 
further provides enzyme restriction sites incorporated into the 
sequences which link the regulatory elements and genes. The inven- 
tion also provides that certain of the regulatory elements be derived 
25 from the same genetic source. 

More particularly, the present invention provides an expression 
cassette in which the promoter is from phaseolin or a'-subunit of 0- 
conglycinin, the animal gene and remaining elements are from bovine 
somato- tropin, and the restriction sites contained within the linkers 
30 are Ncol and Hindlll. 

More particularly, the present invention provides an expression 
cassette In which the promoter is from phaseolin or a'-subunit of 0- 
conglycinin, one poly-A signal is from phaseolin, the remaining 
regions, including a second poly-A site, are from bovine somatotropin 
35 and the restriction sites are Ncol and Hindlll. 

Most particularly, the present invention provides an expression 
cassette f r the xpression and regulation of bovine somatotropin in 



20 



WO 91/13993 



-6- 



PCT/US9i/00887 



15 



seed storage b dt s. 

In an ther mbodiment f the lnv nti nth g ne Is from brazil 
nut 2S storage protein r subunlts thereof, the promoter and tran- 
slation Initiation signal are derived from the same genetic source, 
and the translation termination signal and poly-A region are. from 
phased In or fi-zein 15Kd. 

The invention also provides for cells, plants, and seeds 
containing the expression cassettes disclosed here. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention provides a plant in which a gene coding 
for a specified protein is Incorporated Into the plant's genome in a 
manner such that regulation and expression of the specified protein 
is under the control of a seed specific regulatory sequence. The 
protein which Is of Interest and for which expression Is desired may 
be a plant protein homologous or heterologous to the host plant, or 
on the other hand, may be normally produced In a virus, micro- 
organism, or other animal tissue. 

The following examples illustrate specific embodiments of the 
invention. We have used a modified form of the polymerase chain 
reaction technique to combine various seed storage protein gene 
regulatory elements (promoters, translation Initiation sites, 
translation termination sites, and transcription polyadenylation 
(poly-A) signals) with protein coding regions to form expression 
cassettes which direct and control the production of various modified 
proteins In the seeds of plants. These expression cassettes may then 
be used to transform plant cells from which whole plants are derived. 
Gene regulatory elements are used which may be inserted into hosts of 
either mohocot or dicot plant species. Of particular interest as 
host plants are those which are cultivated for high levels of seed 
30 production. Such hosts include wheat, corn. rice. etc. (monocots) 
and soybean, common bean, pea, etc. (dicots). Corn and soybean 
represent ideal hosts and seed storage protein regulatory regions 
isolated from Phaseolus vulgaris (phaseolin gene). Glycine max 
(promoter from the a'-subunit of /J-conglycinin gene) and Zea mays 
(zein 15 Kd peptide gene) are preferred as sources of regulatory 
regions. Proteins which would hav value when expressed in plant 
seeds include those designed for nutritional improv m nt (e.g.. high 
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sulfur soyb an and c mmon bean, high lysine corn, etc.), Pharm- 
acol glcal lap rtanc (enzymes, and horn nes) , and Industrial value 
(enzymes). We have demonstrated seed specific expression cassettes 
containing genes designed to Improve the nutritional value of seed 
protein meal by encoding a high-sulfur gene. We have also demonstra- 
ted the production and segregation of bovine somatotropin (Bst) 
protein Into protein storage vacuoles. 

Numerous cloning vectors are available which have an appropriate 
restriction site, a replication system, and a marker. As a cloning 
vector we have selected pUC18, Norrander et al, Gene (1983)26:101- 
106, containing an ample ill in resistance marker. Other suitable 
vectors are well known and may be easily adapted for use with the 
specific restriction sites of the present expression cassettes. 

For the genetic engineering experiments described in the 
15 examples one can use many different methods of obtaining the desired 
sequence junctions between regulatory regions and coding DNA, such 
as, site specific mutagenesis, linkers, and/or polymerase chain 
reactions (PCR). We have selected to use a modification of PCR, 
which we call custom polymerase chain reaction (CPCR) as the techni- 
que for the addition of regulatory regions with coding DNA. CPCR 
allows one to incorporate specific enzyme restriction sites directly 
into the linkage oligomers as they are synthesized. The oligomers 
are then used in PCR amplification of a desired gene or regulatory 
element. The advantage to this modification Is a more rapid con- 
struction of expression cassettes from the amplified sequences. 

We include in the following examples expression cassettes for 
transcriptional and translational fusion; two dicot specific seed 
promoter cassettes; and one monocot specific seed promoter cassette. 
Those skilled in the art will recognize that these cassettes may be 
utilized in a variety of hosts. In addition, the incorporation of 
these cassettes into a host plant genome may be accomplished by a 
variety of conventional methods. Such methods include transformation 
with Ti-DNA (A^ tumefaciens, rhlzoeenes . etc.). protoplast fusion, 
injection, electroporation, and microproj ectile bombardment. These 
plant seed expression cassettes are used to express a high sulfur 
seed storage protein gene in the seeds of both dicot and monocot 
plants and to express a non-plant gene, bovine somatotropin (Bst), in 
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the seeds f dicot plants. 

Transgenic plants c ntalnlng th h terol gous gen of interest 
axe obtained through the Rl generati „. Seeds from these plants. R2 
seeds, are stained with protein antibody for TEM visualization of the 
5 heterologous protein localized within the protein storage bodies.. 

The following abbreviations are used throughout this disclosure 
to denote gene regulatory elements: 
P - Promoter 

SP - Translation Initiation Signal or Signal Peptide 
10 11 " Translation Termination Signal 

S - Polyadenylation (Poly-A) Signal 
The following abbreviations are used throughout this disclosure 
to indicate source of the gene regulatory elements: 
bv - genus Bovine 

15 hz " Bertholletia excelsia (Brazil nut) 

gm - Glycine max 
pv - Phaseolus vulgaris 
zm - Zea mays 

The following abbreviations are used throughout this disclosure 
to indicate proteins: 



20 



•Bsf means bovine somatotropin protein. "BST" means the 
gene encoding Bst. 

•Bz" means a sulfur-rich protein isolated from Brazil nut 
seed. "Bz-1". Bz-2". and »Bz-3- are subunits of Bz. These proteins 
25 are described in further detail below. "BZ", "BZ-1", "BZ-2". and -BZ- 
3" are the genes encoding the respective proteins. 

The following terms are used throughout this disclosure and are 
defined as follows: 

"Seed specific expression cassette" means a DNA construct 
containing gene regulatory elements and a gene encoding a protein of 
interest, in a suitable vector for incorporation into a plant genome 
such that the expression of the gene is under the control of the gene 
regulatory regions, the regions being of the specificity, orienta- 
tion. and reading frame such that expression of the gene is in the 
35 plant seed. 

"Expr ssion cassette" means a DNA construct lacking at least a 
gene encoding a pr tein of interest. 
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"Oligomer primer" sans a nucleotide sequence needed for PCR 
amplification of a gene or gene regulatory element. The prim rs used 
in this invent! n hav been synthesized to incorporate specific 
restrict! n enzyme sites into th primer. 

Charts 1 through 22 illustrate the constructions of the present 
invention. The following conventions are used to illustrate plasmids 
and DM fragments: 

(1) The single line figures represent both circular and linear 
double-stranded DNA. 

(2) Asterisks (*) indicate that the molecule represented is 
circular. Lack of an asterisk indicates the molecule is linear. 

(3) Endonuclease restriction sites are indicated above the line. 

(4) Genes are Indicated below the line. 

(5) Distance between genes and restriction sites are not to 
15 scale. The charts show their respective positions only. 

The methods used here employ many standard recombinant DNA 
procedures as well as those for creating transgenic plants and are 
well known by those skilled in the art. Standard methods and laborat- 
ory techniques for recombinant experimentation can be found in T. 
20 Maniatis et al.. -Molecular Cloning; A Laboratory Manual-, Cold 
Spring Harbor Laboratory (1982) (recombinant DNA experimental protoc- 
ols and methods); Miller. -Experiments in Holecular Genetics-. Cold 
Spring Harbor Laboratory (1972) (culture techniques); Southern. J. 
Mol. Biol. 2fi: 503 (1975) (clone verification); Sanger, et al. Proc. 
Natl. Acad. Sci. USA 74:5463 ( i 977 ) (DNA sequencing). S.B. Gelvin and 
R.A. Schilperoot. -Plant Molecular Biology- (1988). supplies techni- 
ques for use in transforming and cloning plants. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

EXAMPLE 1 

30 CONSTRUCTION OF AN EXPRESSION CASSETTE CONTAINING THE 

PHASEOLIN PROMOTER (pvP) AND POLYADENYLATION SIGNAL (pvS) 

FOR TRANSCRIPTION FUSION TO CODING DNA 
The major seed storage protein gene of Phaseolus vulgaris 
(common bean) is known as phaseolin. The complete sequence, includ- 
35 ing 5' and 3' regulatory elements, and cDNA counterpart have been 
published by Slightom et al. (1983). As a source of phase lin 
regulatory regions we use the clone pPhasmin which has the intronless 
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cDNA counterpart in plac of the genomic coding and Intron regions, 
as described by Cramer t al. , Pre. Natl. Acad. USA, 82:334-338 
(1985) . Oligomer prim rs for amplification of phaseolin promoter 
(pvP) are constructed to contain Hindlll (in the 5' primer) and Ncol 
(in the 3' primer) sites. The JITcol site is designed to include a 
translation initiation site which conforms to the rules described by 
Kozak (1986) by containing the sequence AAXXATGG where X represents 
G. A, t or C. Sequences o'f the oligomers used for the amplification 
and engineering of the phaseolin promoter are shown in Chart 1(a). 
Using CPCR technology the restriction sites are fused to the 500 b.p. 
phaseolin promoter region resulting in the fragment designated pvP in 
Chart 1(b). 

In a similar maimer oligomer primers are synthesized to flank 
the phaseolin polyadenylation (poly- A) signal. In this case, however, 
15 the 5' primer Incorporates a Ncol site and the 3' primer a HindHI 
site. The nucleotide sequences of these two primers Is shown in Chart 
1(c). Amplification results in a 1100 b.p. fragment designated pvS in 
Chart 1(d). The pvP and pvS fragments were subjected to digestion by 
Ncol followed by ligation of the fragments (Chart 1(e)). The ligated 
pvP + pvS fragment (1600 b.p.) is subjected to fflndlll digestion, 
purified on polyacrylamide gel. and cloned into ffindlll digested 
PUC18. The clone containing the correct pvP + pvS sequences (con- 
firmed by restriction enzyme mapping and nucleotide sequencing) is 
referred to as pUClSpvPpvS in Chart 1(f). This expression cassette 
is used with coding regions which either contain their own transla- 
tion initiation signal or do not need a signal for proper targeting 
of the protein product. in addition, any coding regions Inserted 
into this cassette must include a translation termination signal. 

EXAMPLE 2 

CONSTRUCTION OF A PHASEOLIN EXPRESSION CASSETTE CONTAINING 
PHASEOLIN PR0M0TOR (pvP). SIGNAL PEPTIDE (pvSP) , 
TRANSLATION TERMINATOR (pvTT) , AND P0LY-A SIGNAL (pvS) 
FOR TRANSLATIONAL FUSIONS 
In this example oligonucleotide primers are designed for the 
amplification of promoter and terminator fragments from pPhasmin 
which contain th phaseolin signal peptide (SP) and translational 
termination (TT) signals, respectively. N-terminal analyses of 
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phaseolin prot in by Parran et al., Phytochemistry 26: 335 (1987), 
show that cleavage Is ragged and ccurs after amln acid resldu 23 
and before amino acid 27. Thus, to ensure that the cass tte contains 
the amino acids necessary for signal peptide cleavage the cassette is 
5 designed to include through residue 31. For CPCR amplification of 
the pvPpvSP fragment oligomers are designed which contain an Inframe 
5' Hindi 1 1 site and an inframe 3' Mcol site; these sequences are 
shown in Chart 2(a). Amplification yields the 580 b.p. fragment 
designated pvPpvSP in Chart 2(b). 

10 To ensure complete reading of the translation termination 

sequence, the 4 amino acids 5' of the translation termination codon 
are retained. These phaseolin 3' regulatory regions are obtained 
using the primers shown in Chart 2(c). Again, these primers contain 
an inframe Nco I site in the 5' oligomer and a JTindlll site In the 3' 

15 oligomer. Amplification yields a 1200 b.p. fragment referred to as 
pvTTpvS in chart 2(d). 

Both fragments, pvPpvSP and pvTTpvS, are subjected to digestion 
by Ncol and the exposed ITcoI sites are ligated together to yield the 
fragment referred to as pvPpvSPpvTTpvS in chart 2(e). This fragment 

20 is then digested with JTindlll to expose CPCR engineered 5' and 3' 
sites followed by cloning into ffindlll digested pUC18. The phaseolin 
expression cassette referred to as pUC18pvPpvSPpvTTpvS In chart 2(f) 
is obtained and checked by restriction enzyme mapping and nucleotide 
sequencing to determine that it is correct. 

25 .This seed specific expression cassette is used with coding 

regions which need a seed specific signal peptide and an inframe 
translation termination region. 

EXAMPLE 3 

CONSTRUCTION OF AN EXPRESSION CASSETTE CONTAINING 
30 THE PROMOTER REGION OF THE GLYCINE MAX SEED STORAGE PROTEIN 

a'-SUBUNIT OF 0-CONGLYCININ (gmP) AND PHASEOLIN POLY-A SIGNAL (pvS) 
In soybean (Glycine sp.) ^-conglycinin represents the major seed 
storage protein. The level of expression is very high; each of the 
two to four subunits account for between 12 to 20% of the total 
35 protein in the seed. The nucleotide sequence of the promoter region 
of Glycine max a'-subunit of /?-conglycinin has been published by 
Doyle t al., J. Biol. Chem. 261:9228 (1986). 
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The promoter (gmP) is amplified fr m 250 tig total Glycine max 
DNA and, by using CPCR, a JUndlll site is added to the 5' end as well 
as an Ncol site at the translation initiator codon (ATG) . Using the 
published sequence oligomers are designed for amplification of the 
promoter (oligomer sequences are shown in Chart 3(a)). The sequence 
of the amplified 600 b.p. gene, designated gmP in Chart 3(b), is 
verified by restriction enzyme mapping and nucleotide sequencing for 
accuracy. 

The fragment gmP was subjected to digestion by Ncol and ligated 
to the HcoZ digested fragment pvS (described in Example 1) to obtain 
the fragment referred to as gmPpvS in Chart 3(c). The resulting 
fragment, gmPpvS. is digested with tfindlll. ligated Into ffindlll 
digested pUC18. and the clone pUClSgmPpvS is obtained. (See Chart 
3(d)). This expression cassette is used to direct seed specific 
expression of coding regions which require transcriptional fusions, 
similar to that shown in Example 1. However, when incorporated into 
the Glycine, genome the level of expression may be somewhat higher 
than the Bxaseolus expression cassette of Example 1, due to the more 
native environment for the Glycine cassette. 

EXAMPLE 4 

CONSTRUCTION OF AN EXPRESSION CASSETTE CONTAINING THE 
GLYCINE MAX a'-SUBONIT OF /9-CONGLYC1NIN PROMOTER (gmP) AND 
SIGNAL PEPTIDE (gmSP) REGIONS AND THE PHASEOLIN TRANSLATION 
TERMINATION (pvTT) SIGNAL AND POLYADENYLATION (pvS) SIGNALS. 
In this example we describe construction of an expression 
cassette and plasmid which is used for the expression of modified 
seed storage protein genes or heterologous genes in the seeds of 
dicotyledonous plants. Oligomers are used to amplify the promoter and 
signal peptide region from Glycine max genomic DNA. For the oligomer 
which is read 5' - 3' the same 5' primer as described in Example 3 is 
used. To ensure complete reading of the signal peptide, the 3' 
oligomer primer is designed to hybridize just beyond the signal 
peptide cleavage point described by Doyle et al, (1985), which occurs 
between amino acid residues 25 and 26. Therefore, we have selected 
35 to add the tfcol site after amino acid residue 29. The primers used 
for this amplification are shown in Chart 4(a) . 

The result of this amplification,- referred to as gmPgmSP in 
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Chart 4(b), is a fragment f 690 b.p. Authenticity of this fragment 
is checked by nucl otide sequencing. It is then subjected to 
digestion by Ncol and ligated to the Ncol cut fragment pvTTpvS 
described in Example 2. The resulting fragment, referred to as 
5 gmPgmSPpvTTpvS in Chart 4(c), was subjected to digestion by Jfindlll 
and cloned into flindlll digested pUC18. The clone (designated 
pUC18gmPgmSPpvTTpvS in Chart 4(d)) is isolated and its structure 
checked by restriction enzyme mapping and nucleotide sequencing. 
This expression cassette is used for the expression of modified seed 
10 storage protein genes or foreign genes in the seeds of dicotyledonous 
plants . 

EXAMPLE 5 

CONSTRUCTION OF AN EXPRESSION CASSETTE CONTAINING 
THE PROMOTER (zraP) AND POLYADENYLATION SIGNAL (zmS) FROM 
15 THE ZEA MAYS 15 kDAL SEED STORAGE PROTEIN GENE. 

The zein proteins constitute about 80% of the total seed protein 
of Zea mays (corn); however, there are many zein-related genes 
responsible for the production of these zein proteins. Recently, one 
particular member of the zein gene families was found to be represen- 
20 ted in low copy number, but responsible for the production of about 
10% of the total zein proteins. These genes are referred as /Nzeirts 
(or 15 Kd) genes and the sequence of one member has been described by 
Pedersen et al. (1986). However, the nucleotide sequence of the 0- 
zein gene reported by Pedersen only extends about 233 b.p. 5' of the 
25 translation initiation, thus the complete £-zein gene promoter may 
not be present. To obtain more of the 0-zein promoter we Isolated a 
second gene, referred to as pZein!5R t and our nucleotide sequence 
analysis (Chart 5) shows that this clone contains an additional 110 
b.p. of 5' flanking DNA, giving a total promoter region of about 350 
30 b.p. to the translation initiation codon (see Chart 5). Analysis of 
many zein type genes has shown that sequences responsible for con- 
trolling the developmental expression of seed specific genes are 
located within these 350 b.p. (see review by Thompson and Lark ins ( 
BioEssays, 10:109-113 1989). 
35 Clone pZeinl5K (Chart 6(a)) provides the starting material for 

the CPCR amplification of its promoter and polyadenylation regi ns. 
How ver, prior to amplification an indigenous Ncol site in the 
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pr moter regi n was removed by flushing with mung bean nuclease 
f 11 wing the procedure f Manlatis, et al, (1982). which caused a 
loss of four base pairs (CATC) . These four base pairs are located 
between the CCAAAT and TATAA elements and thus should not affect the 
function of the promoter. This modified 0-zein gene promoter region 
was amplified using CPCR to add a Sail site and a ffcol site to the 5' 
and 3' primers, respectively. The sequences of the oligomers used to 
accomplish this are shown in Chart 6(b). The amplified product was a 
350 b.p. fragment which is referred to as zmP in Chart 6(c). Addi- 
tional CPCR oligomer primers were used to amplify the polyadenylation 
region of pZein 15K so as to add a (5') Hfcol site and a (3') Sail 
site. Oligomers sequences are shown in Chart 6(d). The amplified 
product was a fragment of approximately 700 b.p. which is referred to 
as zmS in Chart 6(e). The two amplified fragments. zmP and zmS. are 
subjected to digestion with Wcol followed by ligation and isolation 
of the fragment referred to as zmPzmS in Chart 6(f). zmPzmS is 
digested with Sail and cloned into Sail digested pUC18. Clone 
pUClSzmPzmS (Chart 6(g)) is isolated and its structure verified using 
restriction enzyme mapping and nucleotide sequencing. The expression 
cassette puClSzmPzmS is used for seed specific expression of modified 
seed storage protein genes or other genes which do not require a zein 
signal peptide and/or a zein translation terminator. 

EXAMPLE 6 

CONSTRUCTION OF A M0N0C0T SEED SPECIFIC EXPRESSION CASSETTE 
CONTAINING THE 0-ZEIN PROMOTER (zmP) , SIGNAL PEPTIDE (zmSP) . 
TRANSLATION TERMINATOR (zmTT) AND POLYADENYLATION SIGNAL (zmS). 
This expression cassette Is similar to that in Example 5, with 
the addition of the 0-zein signal peptide and translation terminator 
signals. As in Example 5. CPCR adds a 5' Sail site and a 3' inframe 
30 Naol site. Since in this case, however, the signal peptide cleavage 
point is located between amino acid residues 20 and 21, we have 
included amino acid sequences up to and including residue 25. Thus, 
the Ncol site is placed 3' of this residue. The oligomer primers 
used to engineer and amplify this zein promoter and signal peptide 
fragment are shown in Chart 7(a). This amplification results in a 
400 b.p. fragment referred to as zmPzmSP in chart 7(b). 

In a similar manner. CPCR amplification of the /J-zein transla- 
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tion t rminator and polyadenylation signal regions f pZein 15K is 
done using a linker engineered to contain a 5' inframe Ncol site and 
a 3' Sail site; sequences are shown In Chart 7(c). Amplification 
results in a 720 b.p. fragment referred to as zmTTzmS in Chart 7(d). 
The fragments zmPzmSP and zmTTzmS are subjected to digestion by Ncol 
and ligated together to obtain the fragment referred to 
zmPzmSPzmTTzmS in Chart 7(e). This fragment is then subjected to 
digestion by Sail and cloned into Sail digested pUC18. The resulting 
clone, pUClSzmPzmSPzmTTzmS (Chart 7(f)), is confirmed by restriction 
enzyme site mapping and nucleotide sequencing. This seed specific 
expression cassette pUCzmPzmSpzmTTzmS is used to regulate the 
expression of modified seed storage protein genes or other genes 
which may require a monocot or /J-zein signal peptide and inframe 
translation terminator signal. 
15 EXAMPLE 7 

PHASEOLUS VULGARIS, GLYCINE MAX, AND ZEA HAYS 
SEED SPECIFIC EXPRESSION CASSETTES FOR THE EXPRESSION OF 
BRAZIL NUT 2S SULFUR-RICH SEED STORAGE PROTEIN GENE. 
This example Illustrates the use of the seed specific expression 
cassettes designed in Examples 1. 3, and 5 for the expression of a 
sulfur-rich (26% methionine + cystine) seed storage protein isolated 
from Brazil nut. The Isolation and nucleotide sequence of this 
Brazil nut gene has been reported by Altenbach et al., Plant Mol. 
Biol. 8:239-250 (1987). and the cDNA clone referred to as pHS-3 
contains all but 25 b.p. of the 5 ' -untranslated region, all of the 
coding DNA and the complete 3' -untranslated region, as Illustrated in 
Chart 8. Because the gene encodes a protein which is normally found 
in a plant seed environment it Is believed that its stable expression 
and proper transportation to seed storage protein bodied are charac- 
teristics which are inherent to its structure. However, an analysis 
of the sequence reveals a second Ncol site located near the transla- 
tion termination codon. Thus for the ease of engineering we elimina- 
ted this Ncol site by using CPCR to replace the native Ncol site with 
an inframe *cbl site. This results in the second G being replaced 
35 with a C nucleotide and removes the internal Ncol site. The JVcol 
site added via CPCR is used to clone this regi n into an expression 
cassette which has an inframe translation termination codon. For 
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this construction we use th propter regions described in Examples 
1. 3. and 5 and translation terminati n and polyadenylation signals 
described in Examples 2, 4, and 6. 

CPCR amplification of the Brazil nut gene added was used to 
5 engineer flanking Ncol sites; the 5' *col site is added to take 
advantage of the naturally occurring sites located at the Brazil „ ut 
translation! initiation cbdon (ATG) , and the second *coI site is 
reeved and replaced by a Hcol site as described above. A further 
advantage of this modification is the addition of the amino acid 
tryptophan to the gene. Tryptophane is also deficient i„ the 
proteins of Bany plant seeds thus lt> ^ & ^ 

nutritionally collate seed protein. The primers for the amplifica- 
tion of the Brazil nut coding region are shown in Chart 9(a>. 
Amplification yields a 420 b.p. fragment which is referred to as B2 
15 n Chart 9(b). This fragment is subjected to digestion with *eoI and 
inserted into the expression cassettes described earlier. The 
following clones are isolated, pUClSpvPBZpvTTpvS . pOClSgmPBZpvTTpvS 
and pUClBzmPBZzmTTzmS, and their structures are checked using 

20 rr Ctl °, ^ maPPil * ^ nUCle ° tlde -quencing. The structure 
these clones are shown in Charts 9(c) (i), ^ ^ 

cassettes may then be removed and placed into an appropriate vector 

e LrT " T PlantS ' ^ — ' — ^ ^ used in 
either the Agrobacterium systems or in systems which utilize micro- 
projectile bombardment of plant tissues. 
25 EXAMPLE 8 

USB OF PHASEOWS VULGARIS, GLYCINE MAX, AND ZEA MAYS 
SEED SPECIFIC EXPRESSION CASSETTES FOR THE EXPRESSION OF A 
SYNTHETIC SULFUR -RICH SEED STORAGE PROTEIN GENE 

30 acid SCedS fr ° m ^ ^ d6fiCient in th - « -*» 

acid and because no single seed storage protein gene has been found 

Us ml ° f *~ " 18 — ^ these 

« * wil be accomplished via a synthetic coding region. To ill u . 

ha t^ POSSiMlity ^ ° lig ° ffierS ^ < Sh - 
Chart 10) representing portions of the Brazil nut coding DNA. As 

IhH .IT 10(a) ' ^ fiKt ° Ug0ffler e — *• -jor portion of 
h 9 kD subun. (BZ-1), which contains 35 % sulfur amino acids for 
48 amin acid peptide. Chart 10(b) shows a second oligomer con- 
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sistlng f the compl te 9 kD subunit (BZ-2) which encodes 70 amino 
acids containing 28.5% sulfur amino acids. The third oligomer. Chart 
10(c), is the complete 12 kD subunlt (BZ-3) which encodes 93 amino 
acids containing 28% sulfur amino acids. BZ-1 was synthesized as a 
single 144 base oligomer, BZ-2 as a 210 base oligomer and BZ-3 as a 
279 base oligomer. These oligomers were synthesized by utilizing the 
published sequence for the BZ gene and were completed on an Applied 
Blosystems, Inc. Model 380A DM Synthesizer. The brazllnut gene 
fragments were complemented, purified, and had Wool sites added on 
both 5' and 3' ends using CPCR amplification. As Is true of the BZ 
gene In Example 7, each lnframe Wool site yields the codon TGG which 
encodes tryptophan, making resulting plant seed a more nutritionally 
complete foodstuff. 

The primers used for amplification of BZ-1 are shown In Chart 
15 11(a), those for BZ-2 In Chart 11(b), and those for BZ-3 in Chart 
11(c). Because of the addition of the extra nucleotides the frag- 
ments isolated after amplification are about 15 b. p. larger than the 
original synthesized fragments BZ-1, BZ-2, and BZ-3. These fragments 
are subjected to digestion with ffcol and cloned into the Ncol cut 
expression cassettes described in Examples 2, 4 and 6. The following 
clones are isolated. pUC18pvPpvSPBZ-I. -2, and -3pvTTpvS (Charts 
12(a)-(c)), pUCl8gmPgmSPBZ-i, -2. and OpvTTpvS (Charts 13(a)-(c)), 
pUClSzmPzmSPBZ-l. -2, and -JJpvTTpvS (Charts 14(a)-(c)). The struc- 
ture of these clones is checked by restriction enzyme mapping and 
25 nucleotide sequencing. The expression cassettes contained within 
these plasmids may removed and transferred into vectors used for 
Agrobacterium systems or may used with microprojectile bombardment of 
plant tissues. 

EXAMPLE 9 

30 SEED SPECIFIC EXPRESSION CASSETTES FOR 

THE EXPRESSION OF A FOREIGN GENE. 
In this example we have selected to use the seed expression 
cassettes described above for the engineering of vectors capable of 
expressing the bovine somatotropin gene (Bst); this gene is also 
35 referred to as the bovine growth hormone (Bgh) gene. The sequence of 
the Bst gene has been d termined by Woychik et al., Nucleic Acids 
Research, 10:7197-7210 (1982), however, this s quence is not accep- 
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table for expression in a plant system due to the presence of 4 
introns in the native gene (Barta et al. 1986). To avoid this 
problem we use an intronless (minigene) construction of the Bst gene 
obtained from Professor Fritz Rottman, Department of Biology, and 
Microbiology, Case Western Reserve University, Cleveland OH. This 
clone is referred to as pSVGH-1 (delta IVS) and is used to provide 
the Bst coding region for' CPCR engineering. As is clear from the 
prior examples, the use of CPCR allows the coding region to be en- 
gineered in such a manner to be accepted by any of the expression 
cassettes described in Examples 1 to 6. These arrangements can 
include the use of different gene regulatory regions which can be 
provided by the expression cassette or even from the Bst gene. 

Bst protein is encoded in a gene which contains a 26 amino acid 
signal peptide followed by a coding region containing 191 amino 
15 acids. The clone pSVGH-1 also contains the indigenous Bst gene 
polyadenylation signal. For the Bst protein to be active its signal 
peptide must be removed, thus by using CPCR in combination with the 
cassettes described above we have the option of expressing it with or 
without its signal peptide, with or without a seed protein signal 
peptide, and with or without its polyadenylation signal. Thus many 
different construction can be derived for the expression of Bst in 
plant seeds. This example illustrates several of these expression 
cassettes coding for Bst. 

The first series of constructions uses the seed storage protein 
gene expression promoters, described in Examples 1, 3 and 5, to 
construct transcriptional fusions genes which use the Bst signal 
peptide and polyadenylation signal. Using the clone pSVGH (delta 
IVS) as starting material. CPCR is used to add an Ncol site to its 5' 
end (retaining the translation initiation signal) and adding a 
30 WadHl-.it* 3' of the Bst polyadenylation signal. The oligomer 
primers of this construction are shown in Chart 15(a) and are used to 
amplify a 840 b.p. fragment referred to as bvSPBSTbvTTbvS . This 
fragment is subjected to digestion by Ncol and cloned into the seed 
promoter fragments described in Examples 1 and 3, see Charts 15(b) 
35 and 16(a). These ligated seed promoter-Bst fragments are then 
digested with /findlll followed by ligation to HindlU digested P UC18 
and the f llowing clones isolated, pUClSpvPbvSPBSTbvTTbvS and 
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pUC18gniPbvSPSSrbvTTbvS, as shown in Charts 15(c) and 16(b), respec- 
tively. The correct sequence is verified using restriction enzyme 
site mapping and nucleotide sequencing. The inserts from these 
clones can be removed and transferred into either an Agzobacterium 
vector system or into a vector to be used for microproj ectile 
bombardment of plant tissues. 

To avoid the possibility of having an unstable mRNA product, the 
Bst gene can be cloned directly into the Ncol site located within the 
seed expression cassettes shown in Examples 1 and 3. To accomplish 
this only the 3* oligomer primer needs to be changed to include Ncol. 
This is done with the use of the oligomer primer shown in Chart 17(a) 
and CPCR engineering. The fragment containing the Bst gene obtained 
from this amplification is also about 840 b.p. in length, and is 
referred to as bvSPBSTbvTTbvS in Chart 17(b). This fragment is 
digested with Ncol and cloned into the Ncol site of the plant 
expression cassette shown in Example 1. From this ligation, a clone 
designated pUC18pvPbvSPBSTbvTTbvSpvS is obtained and is shown in 
Charts 17(c). In a similar manner, the expression cassette of 
Example 3 is inserted into bvSPBSTbvTTbvS (Chart 18(a)) resulting in 
20 the plasmld pUC18gmPbvSPBSTbvTTbvSpvS of Chart 18(b). 

Additionally, by the use of another 3' oligomer primer a similar 
construction is derived which does not include the Bst polyadenyla- 
tion signal. The 3» primer used to obtain the Bst such a fragment 
without its polyadenylation signal is shown In Chart 19(a). Amplifi- 
25 cation yields a fragment of about 650 b.p., referred to as 
pvSPBstbvTT in Chart 19(b). This is subjected to digestion by Ncol 
and cloned into the plant expression vector shown in Example 1, 
resulting in clone pUC18pvPbvSPBSTbvTTpvS (Chart 19(c)). In addition, 
this Bst fragment (Chart 20(a)) is ligated into the vector of Example 
3 which results in clone pUC18gmPbvSPBSTbvTTpvS , shown in Chart 
20(b) . 

EXAMPLE 10 
SEED SPECIFIC EXPRESSION CASSETTES FOR 
THE EXPRESSION OF A FOREIGN GENE. 
In its active form, Bst does not include its 19 amino acid 
signal peptide, which is cleaved soon after synthesis (W ychik et 
al., 1982). However, in a plant system it is doubtful that this 
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signal peptide would be removed, thus Its removal would have to b 
d ne as part of a purification pr cedure. In addltl n, th above 
constructions do not include the normal signal peptides >f or the plant 
seed storage protein genes to ensure correct targeting of the Bst 
peptide. The following constructions illustrate a translational 
fusion, made with the Bst protein gene, which does not include the 
Bst signal peptide but does contain the signal peptides from the 
plant seed storage protein genes. This is accomplished by using CPCR 
to amplify the part of the Bst gene which does not include the Bst 
signal peptide and cloning it into the expression cassettes described 
in Examples 2 and 4. The polypeptide resulting from expression from 
these fused gene is expected to initially include the seed storage 
signal peptides, which should be removed as a result of normal 
proteolytic cleavage to yield a Bst polypeptide which would be 
15 identical to the naturally occurring peptide except for the presence 
of two to four additional amino acids at its N- terminus. These 
additional amino acids are needed to ensure that the signal peptide 
cleavage site is recognized (see Examples 2, 4 and 6). 

The appropriate Bst gene fragment can be obtained using CPCR 
20 amplification using the two primers shown in Chart 21(a) which 
amplifies a 600 b.p. fragment referred to as BSTbvTT in Charts 21(b) 
and 22(a). This fragment is subjected to digestion by Ncol and then 
ligated into the seed storage protein gene cassettes described in 
Examples 2 and 4 to obtain clones, pUC18pvPpvSPBSTbvTTpvS (Chart 
25 21(c)) and pUC18gmPgmSPBSTbvTTpvS (Chart 22(b)). The engineered 
expression cassettes contained within these clones can be removed by 
digestion with flindlll followed by transfer to vectors designed to be 
used with ^robacterium or microprojectile gene transfer systems. 

EXAMPLE 11 

30 PLANT CONTAINING A CASSETTE FOR EXPRESSION OF BST 

A plant was produced which contained an expression cassette 
coding for the bovine somatotropin gene, BST. We began by removing 
the regulatory region of the phaseolin gene from clone pPhas 8.8 
Bglll, J.L.Slightom, et al, Proc. Natl. Acad. Sci. U.S.A. 80:1897- 

35 1901 (1983) , by restriction endonucleas digestion with Bglll and the 
fragment purified following standardized t chniques. Next, we used 
the transformed SV40 plasmid, pSVbGH-1, (described earlier) contain- 
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ing a complementary DNA (cDNA) f bovine somatatropin (Bst) and a Bst 
polyadenylation region; these are fused downstream f the SV40 
promot r. Plasmid pSVbGH-1 contains a single restriction site for 
BamHI which lies between the SV40 promoter and the Bst gene. Finally, 
5 pPhasBst is made by digestion of pSVbGH-1 with BamHI followed by 
subsequent incorporation of the purified promoter region at the 
restriction site. 

The region of the pPhasBst plasmid containing the Bst genes and 
the phaseolin regulatory region is removed by digestion with Kpn l and 

10 inserted at the KonI site of Agrobacterium binary vector pGA482G to 
make plasmid pCA482G/PhasBst. pGA482G/PhasBst is used to transform 
Agrofractertura strain C582707. A.G. Hepburn, et al, J. Gen. Microbiol. 
131:2961-2969 (1985). Finally, the resultant Agrobacterium is used to 
infect cultured tobacco leaf pieces, from which whole plants were 

15 derived. 

Transgenic tobacco plants containing the FhasBst gene are 
obtained through the Rl generation. Seeds from theses plants, R2 
seeds, are stained with Bst antibody for TEM visualization. Results 
show localization of Bst protein within the protein storage bodies. 
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CHART 1 



Construction of pUC18pvPpS expression cassette 



Hindlll 
1 

5' -CCTAAGAAGCTT GGCACTCTGTGGTCTTTTGGTTC - 3 ' 
5' Oligomer primer 



Ncol 
> 

5 ' - GTCAAGCCCATGG TTAGTAGAGTAGTATTGAATATG - 3 ' 
3' Oligomer primer 



Hindlll Ncol 
i- L 

pvP 
500 b.p. 



Ncol 

5 ' -CAATTGCCATGG TTGTATGAACTAAAATGCATGTAG - 3 ' 
5' Oligomer primer 

Hindlll 

5 ' - CAACTGAAGCTT GTTGAAACATCCCTGAAGTGTC - 3 ' 
3' Oligomer primer 



Nco * Hindlll 
J L 

pvS 
1100 b.p. 
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CHART 1 (continue 



<e) 



Hindlll 



Ncol 



Hindlll 



10 



pvPpvS 
1600 b.p. 



15 



(f) 



Hindlll 
I 



pvp 



Ncol 
—i- 



-> pvs 



Hindlll 



pUC18pvPpvS 
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■ CHART I 

Construction of pUCl8pvPpvSPpvTTpvS 
Hindlll 

5 ' - CCTAAGAAGCTT GGCACTCTGTGGTCTTTTGGTTC - 3 * 
5' Oligomer primer 



Ncol 



5 ' - AATTAATCCATGG CCGGACTGAAGTGGCAAATG- 3 9 
3' Oligomer primer 



Hindlll NcoI 
1— L 

pvPpvSP 
580 b.p. 



Ncol 



5 ' -AATCTTCCATGG GCATTTGTGTACTGAATAAGTATG - 3 ' 
5' Oligomer primer 



Hindlll 



5 ' - CAACTGAAGCTT GTTGAAACATCCCTGAAGTGTC - 3 ' 
3' Oligomer primer 



Hindlll 

a 

pvTTpvS 
1200 b.p. 



Ncol 
1- 
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(e) 



Hindll I 
J 



CHART 2 (continued) 



Ncol 



pvPpvSPpvTTpvS 



Hindlll 
L 



15 



20 



(f) 



Hindll I 
I 



Ncol 



PYP > pvs > pvTT > pvs 

pUC18pvPpvSPpvTTpvS 



Hindlll 
I 
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CHART 3 



Construction of pUClBgmPpvS 



Htndlll 
I 



5 1 - T ACATGAAGCTT GATCCATGCCCTTCATTTGCCG- 3 ' 
5' Oligmer primer 

Ncol 

5 9 -TCAAGCCATGG TTAGTATATCTTAAATTCTTTAATAC - 3 ' 
3-* Oligomer primer 



HindHI Ncol 
J _l 



gnP 
600 b.p. 



HindHI Ncol HindHI 
J 1 L 

gaPpvS 
1700 b.p. 



HindHI Ncol HindHI 

L- 1 L_* 

gPLr > pvS - > 

pUC18gmPpvS 
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« 

Construction f pUC18gmPgmSPpvTTpvS 
Hindlll 

5 ' -TACATGAAGtTT GATCCATGCCCTTCATTTGCCG - 3 ' 
5' -Oligmer 

Ncol 
1 

5 ' -TTTAAATCCATGG TTCCCAATACGCAATGCCAAATGAG - 3 ' 
3' Oligmer 

Hindlll Ncol 
J— L 

gmPgmSP 
690 b.p. 

Hindlll 

— L 



Hindlll Ncol Hindlll 

* 1 L - 1 * 

gmp > gmSP > pvTT > pvs > 

pUC18gmPgmSPpvTTpvS 



Hindlll Hcol 
J U 



gmPgmSPpvTTpvS 
1890 b.p. 
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CHART 5 

Nucleotide Sequence of pZeinl5k 
GGATCCAAGGCACCTAACAACTAGCACAACATTACAACAAAATATTGGTGTATATGTGCC 



TACAATAAGGTGATGAGTCATGGTGATGTGTAAAGAGGCATTACAAAGTTAGCTTCACAA 

"■I- 1 ...... .4.. j 

+ + 120 



+ + + +- 



15 

GCXTrATGAATTCATTGACAACCCTTGACATGTAAAGTTGATTCATATGTATAAGAAAGCT 



20 TAATGATCTATCTGTACATCCAAATCCATGTACTATGTTTCCACgTCATGCAACGCAACA 

"-+"- — —+ + + + + 24Q 



25 



300 



TTCCAAAACCATGGATCATCTATAAAT6CCTAGCTCCCACATATGAACTAGTCTCTATCA 
" " + + + +---- + + 

TCATCCAATCCAGATCAGCAAAGCGGCAGTGCGTAGAGAGGATCGTCGAACAGAACAGCA 

30 + + + + + 350 

M 

361 TGAAGA ^ CATCOTCT ^«nrac 

k VYY VTT T'c'l* TTTVT's'TTT'r 420 

421 ^^^^f^f^ 

4o q 77V77t"rrrG"L"TV77frr; + 480 



481 



CGACGATGATGGGCGCCGGCGGGCTGTACCCCTACGrcGA(?IACCTGAGGCAGCCGCAGT 



■+ + + 



■+ + + 540 



45 TM MGAGGLYPY AEYLRQPQc 

541 ^t^???"?? 0 ^ 

so S P L ' a " a "a"^V y"a + 'g"c"g"V"t"s"1" m"y"q + 600 

601 ^f!^^^ 

55 P L R Q Q C*T"q"q"q + V"r"m" + M "d"V'J"s"'v"a + ^ 
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CHART 5 fconHp,,^) 

CGCaGCAGCTGCAGATGATGATGCAGCTTGAGCGTGCCGCTGCCGCCAGCAGCAGCCTGT 
5 661 h — --4- - . - _ _ „^ „-f-„__ _^ 720 

QQLQMMKQLERAA A A S S S L Y + 

ACGAGC ^ G CTCTGATGCAGCAGCAGCAGCAGCTGCTGGCACTCCAGGGTCTCAACCCCA 

10 721 + l. : j , , .,„„ 

_ _ . + + +-- + 780 

EPALMQQQQQLLALQGLNPM 

i * -,oi TCGCCATGATGATGGCGCAGAACATGCCGGCCATGGGTGGACTCTACCAGTACCAGTACC 

ID 781 *"'■■••*-*+-••»**--•+•••-•••••+........ -i- _^ 840 

AMMMAQNMPAMGGLYQYQ Y Q 

on o,-, AGCTCCCCAGC TACCGCACCAACCCCTGTGGCGTCTCCGCTGCCATTCCGCCCTACTACT 

841 ^ + ^ 900 

LPSYRTNPCGVSAAIPP Y Y * 
25 901 GA "^ TGATATrl ^ AAATCTCCTCT ^ 

GCAGTAAGACGACACACATTATCATGTGTGGTATGACCAATAATATATGCATCATAATAA 
30 + + + + + +1020 

1021 AGlmGGTmAAAGAA ^ TCGGAC GCTrGATATCTATGA^ 



35 



^ no, ^^^TATAAATTGTAAATATTTCAAAATCTCTATTTAGGCTCCAATGGAGAGCATATG 
1081 + + " + + + +1140 



40 GGTAGAGTAGTATATATGCTTGAAATACTAACAACTAGCAAAGTGCGGGCACGTTGCT/ 

+ + + + + +1200 



1141 



rAC 



ATGCTCATTTATGCTCGAGCATGGAGTATAAAACATAAAGATATATATGTTCTATTGGCT 

+ +1260 



45 1201 + + + + 



TGGTAAACGTGGATATAGGTTTAAAGCCAACAACTATGGTTCGAATCCTCATTTATAX 



1261 + + . 



AT 



mtctataa ™ a gcgtttttactatttaaatttto;gag^^ 

Ull • I I 

55 
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CHART 5 (continued) 



5 GACHTACTCnTGACAACAAATCAATTCGGAAGACACAACAGTAAAAGATAAAATTTCCGA 

1381 + + + + + +1440 



CGGCCAAATGCTTATGCTTCGACGACACCTATCCCTCGGCTCGTAATGGGTCGGAAATAA 
10 1441 + +--"- + + + ---+1500 



GGTTATTTCTGACGGCGGCCGTCGGAAATAGCTTATGTTCCGTACGGCCCGAAAGGGCCA 
1501 +- + + + + +1560 

15 

TCGATCATTTACCTAAGATAACGATCGTAACGGCCGATGGTGGGTT 
1561 + + + + + +1620 

20 

TCGACGCACTGCTAACCATCGATTATTCGATTCTATGTCTCCGAGGCTTATAGTATTGCG 
1621 + +--. + + + -+1680 

25 GGCAAACTTATCTGGCATTATGAAOT 

1681 + + + + + +1740 



30 



CATATAC 
1741 - 1747 
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CHART 6 
Construction of pUC18zmPzmS 

5 

(a) *_ 



10 



20 



45 



50 



III II 
P > SP > coding > TT > S > 



pZein 15k 



15 (b) Sail 



5' -ATAGCTGTCGAC GATCCAAGGCATCTAACAACT- 3 ' 
5' oligomer primer 



Ncol 



25 5 ' - GTCGTACCATGG TGTTCTGTTCGACGATCCTCTCTA* 3 ' 

3' oligomer primer 

30 

(c) Sail Ncol 



zmP 

35 350 b.p. 



(d) Ncol 
40 l 



5 1 - CATAGCCATGG TATTTGGGAAATCTCCTTC - 3 ' 
5' oligomer primer 

Sail 



5 ' - TAGACTGTCGAC AAGAACTGTGGAACCCACCATCG - 3 r 
3' oligomer primer 



55 
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CHART 6 fcontlnua^ nn) 



5 

(e) Ncol 



Sail 



zmS 

10 700 b.p. 



( f ) Sail Hcol 



Sail 



J —J 1 



zmPzmS 
1050 b.p. 



<S> Sail Hcol sail 

J I * 



* 



zmP > ... 

pUC18zmPzmS 
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CHART 7 



Construction of pUC18zmPzmSPzmTTzmS 



Sail 

5' -ATAGCTGTCGAC GATCCAAGGCATCTAACAACT-3' 
5' oligomer primer 



Ncol 

5' -AATTACCCATGG GCAGGGCATCTGCATTGCAGAG- 3 ' 
3' oligomer primer 



Sail Ncol 
J 1 

zmPzmSP 
400 b.p. 

Ncol 
_L_ 

5 ' - AATCATCCATGG C CCTACTACTGATTC ATGATATTTG - 3 ' 
5' oligomer primer 

Sail 
— 1_ 

5' -TAGACTGTCGAC AAGAACTGTGGAACCCACCATCG- 3 ' 
3' oligomer primer 

Ncol sail 



zmTTzmS 
720 b.p. 
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CHART 7 {continuation) 



10 



(e) Sail Ncoi SalI 
J — L L 

zmPzmS PzmTTzmS 
1120 b.p. 



(f) Sail NcoI SalI 

* 1 -J u 

15 zmP > zmSP > zmTT > zmS > 



pUC18zmPzmSPzmTTzmS 
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CHART 8 

Nucleotide Sequence of Brazil Nut 
5 Sulfur-Rich Seed Storage Protein 



TCCACCACTGCTCTGTATCACATATACCAACCCACGCTTATACCCAGAATCACCATGGCG 
10 1 + + + + + 60 

M A 

AAGATTTCAGTTGCGGCAGCAGCCCTCCTTGTCCTCATGGCCCTCGGCCACGCGACCGCC 

15 61 + + + + + + 120 

K1S VAAAALLVLMALGHATA 

TTCCGGGCCACCGTCACCACCACAGTGGTGGAGGAGGAGAACCAGGAGGAGTGTCGCGAG 
20 121 + + + + + + 180 

FRA TVTTTVVEEENQEECRE 

CAGATGGAGAGACAGCAGATGCTCAGCCACTGCCGGATGTACATGAGACAGCAGATGGAG 
25 181 + + + + + + 240 

QMQRQQ MLSHCRMYMRQQME 

GAGAGCCCGTACCAGACCATGCCCAGGCGGGGAATGGAGCCGCACATGAGCGAGTGCTGC 
30 241 + + + + + + 300 

ESP YQTMPRRGMEPHMSECC 

GAGGAGCTGGAGGGGATGGACGAGAGCTGCAGATGCGAAGGCTTAAGGATGATGATGATG 
3 5 301 + + + + + + 360 

EQL EGMDESCRCEGLRMMMM 

AGGATGCAACAGGAGGAGATGCAACCCCGAGGGGAGCAGATGCGAAGGATGATGAGGCTG 
40 361 +- + + + + + 420 

RMQ QEEMQPRGEQMRRMMRL 

GCCGAGAATATCCCTTCCCGCTGCAACCTCAGTCCCATGAGATGCCCCATGGGTGGCTCC 

* 5 «1 + + + + + + 480 

AE NIPSRCNLSPMRCPMGGS 

ATTGCCGGGTTCTGAATCTGCCACTAGCCAGTGCTGTAAATGTTAATAAGGCTCTCACAA 
50 481 + + + + + + 540 

I A G F * 
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CHART 8 (conttnuaM np - ) 



ACTAGCTCrrTTGTTGGCTTTTGGCCGGAGACTAGGGTGTGGGATTAATAATAATAGCACA 

+ + + + -+ 600 



CTATCGTGTGTTCTCAGCTTC 
10 601 -.+ + , 621 
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CHART 9 



Construction of Expression Cassettes Using the Brazil Nut 
Sulfur-Rich Seed Storage Protein Gene (BZ) 



< a > Ncol 



5 ' -AATTCCACCATGG CG AAG ATTT CAGTTG - 3 ' 
5' oligomer primer 



Ncol 

-L 



5 ' - AATTC AACCATGG CATGGGGGCATCTCATGGGACTG - 3 ' 
3' oligomer primer 



25 



30 



35 



50 



(b) Ncol NcoI 

J L 

BZ 
420 b.p. 



(c)(i) HindHI Ncol Ncol 
* 1 J L 



Hindi II 

* 



pvP > BZ > pvTT > pvS > 

pUC18pvPBZpvTTpvS 



*° (c)(ii) HindHI Ncol Ncol HindHI 

* 1 | I l" 

gmP > BZ > pvTT > pvS > 

A5 pUC18gmPBZpvTTpvS 



* 



(c)(iii) Sail Ncol - Ncol . HindHI 

*— L 1 L 1 * 

zmP > BZ > zmTT > zmS > 

pUC18zmPBZzmTT2mS 
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CHART 10 

(a) Nucleotide Sequence of a Synthetic 9 kD subunlt of a Brazil 
Nut Sulfur-Rich Seed Storage Protein (BZ-1) 

ATGGAGCCGCACAT6AGCGAGTGCTGCGAGCAGCTGGAGGGGATGGACGAGAGCTGCAGA 
HEP H MSECCEQLEGMDESCR 



15 TGCGAAGGCTTAAGGATGATGATGATGAGGATGCAACAGGAGGAGATGCAACCCCGAGGG 
C E G LRMMMMRMQ Q e'T'm" Y P Y"g + 

20 GAGCAGATGCGAAGGATGATGAGG 

"I + 144 

EQMRRMMR 



25 



30 



35 



(b) Nucleotide Sequence of a Synthetic Brazil Nut Sulfur- 
Rich Seed StorageProtein Complete 9 kD Gene (BZ-2) 



CCGAGGCGGGGAATGGAGCCGCACATGAGCGAGTGCTGCGAGCAGCTGGAGGGGATGGAC 
1 + + + + + + 60 

PR* G MEPHMSECCEQLEGMD 



GAGAGCTGCAGATGCGAAGGCTTAAGGATGATGATGATGAGGATGCAACAGGAGGAGATG 
40 ESC R C VTTT H* m"V M*V VT T'e'TV 



121 +.- + + + + ... . ,„ n 

45 QPR GEQMRRMMRLAENIP "s" R 

TGCAACCTCAGTCCCATGAGATGCCCCATG 

c _ 181 + + + 210 

50 C NLSPMRCPM 



CAACCCCGAGGGGAGCAGATGCGAAGGATGATGAGGCTGGCCGAGAATATCCCTTCCCGC 
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CHART 10 fcontlnuari) 



(c) Nucleotide Sequenc of a Synthetic Brazil Nut 12 kD Subunit 
of the Sulfur-Rich Seed Storage Protein Gene (BZ-3) 



10 CAGCAGATGCTCAGCCACTGCCGGATGTACATGAGACAGCA6ATGGAGGAGAGCCCGTAC 

1 + + + + + + 60 

QQM LSHCRMYMRQQKEESPY 

15 

CAGACCATGCCCAGGCGGGGAATGGAGCCGCACATGAGCGAGTGCTGCGAGCAGCTGGAG 

61 - . 

QTM PRRGMEPHMSEC C *E Q L E + 

20 

GGGATGGACGAGAGCTGCAGATGCGAAGGCTTAAGGATGATGATGATGAGGATGCAACAG 
121 + +-- + + + + 180 

GMDESC RCEGLRMMMMRMQQ 

25 

GAGGAGATGCAACCCCGAGGGGAGCAGATGCGAAGGATGATGAGGCTGGCCGAGAATATC 

181 — I .4 ._.._.._.| | ^ _ ^ 240 

EEM QPRGEQMRRM mTl'Ve'n "i 

30 

CCTTCCCGCTGCAACCTCAGTCCCATGAGATGCCCCATG 

241 + ♦ + 279 

PSRCNLSPMRCPM 

35 
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CHART 11 

Ncol 

' - AATTATC CATGG ATGGAGCCGCACATGAGCGAGTG- 3 ' 
5' oligomer primer 

Ncol 

_L_ 

5 ' - AATTCTCC ATGG CATCATCCTTCGCATCTGCTC - 3 ' 
3' oligomer primer 



Ncol 

5' -AATTCACCATGG CCAGGCGGGGAATGGAGCCGC-3 ' 
5* oligomer primer 

Ncol 
1 

5 ' -AATTCACCATGG CATGGGGCATCTCATGGGACTG - 3 f 
3' oligomer primer 



Ncol 



5 ' -ATTCACCATGG CAGCAGATGCTCAGCGACTGCC-3 ' 
5' oligomer primer 



Ncol 

5 ' -AATTCACCATGG CATGGGGCATCTCATGGGACTG - 3 ' 
3' oligomer primer 
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CHART \ Z 

Phaseolin Expression Cassettes Using 
5 Brazil Nut Gene Fragments 



(a) Hindlll Ncol Ncol Hindlll 

10 * 1 'I | | * 

pvP > pvSP > BZ1 > pvTT > pvS > 

pUC18pvPpvSPBZ- lpvTTpvS 



15 



20 



(b) Hindlll Ncol Ncol Hindlll 

* L- 1 I | * 

pvP > pvSP > BZ2 > pvTT > pvS > 

pUC18pvPpvSPBZ- 2pvTTpvS 



25 (c) Hindlll Ncol Ncol Hindlll 



* 



pvP > pvSP > BZ3 > pvTT > pvS 



* 



30 



pUC18pvPpvSPBZ-3pvTTpvS 
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CHART 13 



Glycine Expression Cassettes Using 
Brazil Nut Gene Fragments 



10 



15 



20 



(a) Hindlll 



Ncol 



Ncol 



Hindi II 
1_* 



gmP > gmSP > BZ1 > pvTT > pvS > 

pUC18gmPgmSPBZ- IpvTTpvS 



(b) Hindlll 



Ncol 



Ncol 
I 



gmP > gmSP > B22 ----> pvTT > pvS 

pUC18gmPgmSPBZ- 2pvTTpvS 



Hindlll 



— > 



25 (c) 



30 



Hindlll 
* -L 



Ncol 
-L 



Ncol 



gmP > gmSP > BZ-3 > pvTT 

pUC18gmPgmSPBZ- 3pvTTpvS 



Hindlll 



-> pvS > 
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CHART 14 



Zein Expr ss ion Cassettes Using 
Brazil Nut Gene Fragments 



(a) Sail Ncol Ncol Sail 

10 *_1 J | j_* 

zmP > zmSP > BZ-1 > zmTT > zmS > 



15 



20 



pUC18zmPzmSPBZ-lzmTTzmS 



0>) Sail Ncol Ncol Sail 

* 1 1 I l_* 

zmP > zmSP > BZ-2 > zmTT > zmS > 



pUC18zmPzmSPBZ- 2zmTTzmS 



25 (c) Sail Ncol Ncol Sail 

*— I 1 I L_* 

zmP > zmSP > BZ-3 > zmTT > zmS > 



30 



pUC18zmPzmSPBz-3zmTTzmS 
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CHART 15 

Phaseolin Expression Cassettes Using BST Gene 

Ncol 
— U 

5' - AATTCAACCATGG GCTGCAGGCCCCCGGACCTCCC-3' 

5' oligomer primer 

Ncol 
— l_ 

5 ' - AAATTC AAAGCTT GCATCCCCAGCATGCCTGCTATTG- 3 ' 
3* oligomer primer 

Hindi I I Ncol Hindi I I 

J— 1 1 

pvPbvSPBSTbvTTbvS 

Hindlll Ncol Hindm 

*— I 1 I * 

pvP ---> bvSP ---> BST ---> bvTT — -> bvS — > 

pUC18pvPbvSPBSTbvTTbvS 
CHART 16 

Glycine Expression Cassettes Using BST Gene 



Hindlll Ncol Hindlll 
-I 1 1 

gmPbvS PBSTbvTTbvS 



Hindlll Ncol Hindlll 

* L 1 _ | * 

gmP ---> bvSP --> BST bvTT ---> bvS ---> 



pUC18gmPbvSPBSTbvTTbvS 
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CHART 17 

Phaseolin Expression Cassettes Using BST Gene 
Ncol 



5' - AAATTACCATGG GCATCCCCAGCATGCCTGCTATTG - 3 ' 
3' oligomer primer 

Ncol Ncol 
J -1 

bvSPBSTbvTTbvS 

Hindlll Ncol NcoI Hindlll 

*— J 1 I | * 

pvP ---> bvSP — > BST ---> bvTT ---> bvS ---> pvS ---> 

pUCl 8 pvPbvS PB STbvTTbvSpvS 
CHART 18 

Glycine Expression Cassettes Using BST Gene 



Ncol NcoI 
J 1 

bvSPBsTbvTTbvS 



Hindi 1 1 Ncol NcoI HindI j j 

* 1 1 I 1 

gmP bvSP --> BST --> bvTT -> bvs --> pvS - -> 

pUC18gmPbvSPBSTbvTTbvSpvS 
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CHART 19 



Phaseolin Expression Cassettes Using BST Gene 
Ncol 



5 ' - AATTCACCATGG GGGGGAGGGGCAAACAACAGATGG- 3 ' 
3' oligomer primer 

Ncol Ncol 



pvSPBSTbvTT 

Hindlll Ncol Ncol Hindlll 

*— 1_ L 1_ L_* 

p V p > bvSP > BST > bvTT > pvS > 

pUC18pvPbvSPBSTbvTTpvS 
CHART 20 

Glycine Expression Cassettes Using BSt Gene 



Ncol Ncol 

J I 

pvSPBSTbvTT 



Hindlll Ncol Ncol Hindlll 

* — l 1 I I 

gmP ---> bvSP---> BST ---> bvTT ---> pvS ---> 



pUC18gmPbvSPBSTbvTTpvS 



VUS91/00887 



-47- 

CHART 21, 

Phaseolin Expression Cassettes Using BST Gene 
Ncol 



5 ' -AATCATCCATGG TTCCCAGCCATGTCCTTGTCCGGCC - 3 ' 
5' oligomer primer 

Ncol 

5 ' -AATCACCATGG GGGGGAGGGGCAAACAACAGATGG - 3 ' 
3' oligomer primer 



Ncol NcoI 

J— 1 

BSTbvTT 



HindHI Hcol KcoI HindHI 

* 1 J . | 

pvP ---> pvSP ---> BST ---> bvTT — > pvS ---> 

pUC18pvPpvSPBSTbvTTpvS 
CHART 22 

Glycine Expression Cassettes Using BST Gene 



Nc °* Ncol 
i 1 

BSTbvTT 



HindHI Ncol NcoI HindHI 

* 1 J ~ I I * 

gmP ---> gmSP ---> BST ---> bvTT ---> pvS ---> 

pUCl 8 gmPgmS PBSTbvTTpvS 
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CLAIMS 

We claim: 

1. A seed specific expression cassette having gene regulatory 
elements comprising: 

a) a promoter selected from the group consisting of phaseolin 
promoter, a'-subunit of 0-conglycinin promoter, and 0-zein promoter; 

b) a translation initiation signal selected from the group 
consisting of phaseolin translation initiation signal, a'-subunit of 
0-conglycinin translation initiation signal, 0-zein translation 
initiation signal, and animal gene translation initiation signal; 

c) a gene selected from the group consisting of an animal gene 
and a gene from brazil nut 2S storage protein; 

d) a poly-A signal selected from the group consisting of a 
15 phaseolin poly-A signal and an animal gene poly-A signal; 

wherein the promoter is upstream and operably linked to the initia- 
tion signal, the initiation signal is upstream and operably linked to 
the gene, and tfie gene is upstream and operably linked to the poly(A) 
signal; and provided that when the gene regulatory element is ampli- 
fied from genomic or synthetic DNA the regulatory region is flanked 
by a restriction enzyme recognition site selected from the group 
consisting of Ncol, Hindlll, and Sail; and further provided when the 
translation initiation signal is from phaseolin the promoter is from 
phaseolin, and when the translation initiation signal is from the cr'- 
25 subunit of 0-conglycinin the promoter is from the a'-subunit of 

conglyctnin, and when the gene is from brazil nut the promoter and 
translation initiation signal are derived from the same genome. 



20 



30 



35 



2. A seed specific expression cassette according to Claim 1 wherein 
the promoter is from phaseolin, the remaining regions are from bovine 
somatotropin, and further comprising a SV40 promoter region upstream 
and operably linked to the phaseolin region. 

3. A seed specific expression cassette according to Claim 1 wherein 
the gene is an animal gene from bovine somatotropin and includes the 
translation initiation signal. 

4. A seed specific expr ssion cassette according to Claim 3 wherein 
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the promoter is from phaseolin, the remaining elements are from 
bovine somatotropin, the regions linking both the promoter and the 
poly-A elements to the plasmid each code for Hindlll, and the regions 
linking the promoter to the translation initiation signal codes for 
5 Ncol . 

5. A seed specific expression cassette according to Claim 3 wherein 
the promoter is from the a' -subunit of /9-conglycinin, the remaining 
elements are from bovine somatotropin, the regions linking both the 

10 promoter and the poly-A elements to the plasmid each code for 

Hindlll, and the region linking the promoter to the translation 
initiation signal codes for Ncol. 

6. A seed specific expression cassette according to Claim 3 wherein 
15 the promoter and the poly- A signal are from phaseolin, the remaining 

elements are from bovine somatotropin, the region linking both the 
promoter and the poly-A elements to the plasmid each code for 
Hindlll, and the region linking the promoter to the translation 
initiation signal and the gene to the poly-A signal code for Ncol. 

20 

7. A seed specific expression cassette according to Claim 3 wherein 
the promoter and the poly-A signal are from the a' -subunit of 0-con- 
glycinin, the remaining elements are from bovine somatotropin, the 
region linking both the promoter and the poly-A elements to the 

25 plasmid each code for Hindlll, and the region linking the promoter to 
the translation initiation signal and the gene to the poly-A signal 
code for Ncol. 

8. A seed specific expression cassette according to Claim 3 wherein 
30 the promoter, translation initiation signal, and poly-A signal are 

from phaseolin, the region linking both the promoter and the poly-A 
elements to the plasmid each code for Hindlll, and the region linking 
the translation initiation signal to the gene and the gene to the 
poly-A signal code for Ncol. 

35 

9. A seed specific expression cassette according to Claim 3 wherein 
th pr moter is from phaseolin, the translation initiation signal and 
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gene re from bovine somatotropin, and having a first poly- A signal 
from bovine s matotropin upstream and operably linked to a second 
poly-A signal from phaseolin, the region linking both the promoter 
and the second poly-A elements to the plasmid each coding for 
5 Hindlll, and the region linking the promoter to the translation ' 
initiation signal and the first poly-A signal to the second poly-A 
signal coding for Ncol. 

10. A seed specific expression cassette according to Claim 3 wherein 
10 the promoter Is from the o'-subunit of 0-conglycinin, the translation 

initiation signal and gene are from bovine somatotropin, and having a 
first poly-A signal from bovine somatotropin upstream and operably 
linked to a second poly-A signal from phaseolin, the region linking 
both the promoter and the second poly-A element to the plasmid each 
15 coding for Hindlll, and the region linking the promoter to the 
translation initiation signal and the first poly-A signal to the 
second poly-A signal coding for Ncol. 

11. A seed specific expression cassette according to Claim 1 com- 
20 prising: 

a) a promoter selected from the group consisting of phaseolin 
promoter, o'-subunit of £-conglycin±n promoter, and fi-zein 15K 
promoter . 

b) a translation initiation signal selected from the group 

25 consisting of phaseolin translation initiation signal, o'-subunit of 
0-conglycinin translation initiation signal, and />-zein 15K transla- 
tion initiation signal; 

c) a gene selected from the group consisting of brazil nut gene 
and brazil nut gene subunit; 

d) a translation termination signal selected from the group 
consisting of phaseolin translation termination signal and 0-zein 15K 
translation termination signal; 

e) a poly-A signal selected from the group consisting of phaseo- 
lin poly-A signal and £-zein 15K poly-A signal; 

wherein the promoter is upstream and operably linked to the initia- 
tion signal, the initiation signal is upstream and operably linked to 
the gene, the gene is upstream and operably linked to the translation 
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10 



20 



25 



30 



35 



termination signal, and th translation termination signal is 
upstream and operably linked to the poly-A signal, and further 
provided that the gene regulat ry element is flanked by a restriction 
enzyme recognition site selected from the group consisting of Ncol , 
HindHI, and Sail; and further provided that the promoter and 
translation initiation signals are derived from the same genome. 

12. A seed specific expression cassette according to Claim 11 
wherein the gene is BZ. 



13. A seed specific expression cassette according to Claim 12 
wherein the promoter, translation termination signal, and poly-A 
signal are from phaseolin, the region 15 >.ing both the promoter and 
poly-A elements to the plasmid each code for HindHI, and the region 

15 linking the promoter to the gene- and the gene to the termination 
signal code for Ncol. 

14. A seed specific expression cassette according to Claim 12 
wherein the promoter is from the o'-subunit of 0-conglycinin. the 
translation termination signal and poly-A signal are from phaseolin, 
the region linking both the promoter and poly-A elements to the 
plasmid each code for HindHI. and the region linking the promoter to 
the gene and the gene to the termination signal code for Ncol. 

15. A seed specific expression cassette according to Claim 12 
wherein the promoter, translation termination signal, and poly-A 
signal are from /J-zein 15K, the region linking both the promoter and 
poly-A elements to the plasmid each code for Sail, and the regions 
linking the promoter to the gene and the gene to the termination 
signal code for Ncol. 

16. A seed specific expression cassette according to Claim 11 
wherein the gene is selected from the group consisting of BZ-1, BZ-2 
and BZ-3. 

17. A seed specific expression cassette according to Claim 16 
wh rein the remaining elements are from phas olin, the region linking 
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b th the promo t r and poly- A elements to the plasmid each code f r 
HIndlll, and the region linking the translati n Initiation signal to 
the gene and the gene to the termination signal code for Ncol. 

5 18. A seed specific expression cassette according to Claim 16 
wherein the remaining elements are from fi-zein 15K, the region 
linking both the promoter and poly-A elements to the plasmid each 
code for Sail, and the region linking the translation initiation 
signal to the gene and the gene to the termination signal code for 
10 Ncol. 

19. A cell comprising a seed specific expression cassette as claimed 
in Claim 1. 

15 20. A seed or plant comprising cells as claimed In Claim 19. 
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